智能论文笔记

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Improving Document Image Understanding with Reinforcement Finetuning

Bao-Sinh Nguyen , Dung Tien Le , Hieu M. Vu , Tuan Anh D. Nguyen , Minh-Tien Nguyen , Hung Le

分类：计算机视觉 | 机器学习

2022-09-26

成功的人工智能系统通常需要大量标记的数据来从文档图像中提取信息。在本文中，我们研究了改善人工智能系统在理解文档图像中的性能的问题，尤其是在培训数据受到限制的情况下。我们通过使用加强学习提出一种新颖的填充方法来解决问题。我们的方法将信息提取模型视为策略网络，并使用策略梯度培训来更新模型，以最大程度地提高补充传统跨凝结损失的综合奖励功能。我们使用标签和专家反馈在四个数据集上进行的实验表明，我们的填充机制始终提高最先进的信息提取器的性能，尤其是在小型培训数据制度中。

translated by 谷歌翻译

SoftGroup++: Scalable 3D Instance Segmentation with Octree Pyramid Grouping

Thang Vu , Kookhoi Kim , Tung M. Luu , Thanh Nguyen , Junyeong Kim , Chang D. Yoo

分类：计算机视觉

2022-09-17

现有的最新3D点云实例分割方法依赖于基于分组的方法，该方法指向获得对象实例。尽管产生准确的分割结果方面有所改善，但这些方法缺乏可扩展性，通常需要将大量输入分为多个部分。为了处理数百万点的场景，现有的最快方法软组\ cite {vu2022222222222222222222222222222222222222ggroup}需要数十秒钟，这是满意的。我们的发现是，$ k $ neart的邻居（$ k $ -nn）是分组的先决条件，是计算瓶颈。这种瓶颈严重使现场的推理时间恶化了很多。本文提出了软组++来解决此计算瓶颈，并进一步优化了整个网络的推理速度。 SoftGroup ++建立在软组上，这在三个重要方面有所不同：（1）执行OCTREE $ K $ -NN而不是Vanilla $ k $ -nn，以将时间复杂性从$ \ Mathcal {o}（n^2）缩短到$ \ Mathcal {o}（n \ log n）$，（2）执行金字塔缩放，适应性下降样本骨干输出以减少$ k $ -nn和分组的搜索空间，并且（3）执行后期的Devoxelization，延迟了Voxels的转换指向模型的结束，以使中间组件以低计算成本运行。在各种室内和室外数据集上进行了广泛的实验，证明了拟议的软组++的功效。值得注意的是，SoftGroup ++在一个前方的情况下通过单个前方进行了大量的场景，而无需将输入分为多个部分，从而丰富了上下文信息。特别是，SoftGroup ++达到2.4点AP $ _ {50} $改进，而$ 6 \ $ 6 \ times $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $。代码和训练有素的模型将公开可用。

translated by 谷歌翻译

Learning to diagnose common thorax diseases on chest radiographs from radiology reports in Vietnamese

Thao T. B. Nguyen , Tam M. Vo , Thang V. Nguyen , Hieu H. Pham , Ha Q. Nguyen

分类：计算机视觉

2022-09-11

我们提出了一个数据收集和注释管道，该数据从越南放射学报告中提取信息，以提供胸部X射线（CXR）图像的准确标签。这可以通过注释与其特有诊断类别的数据相匹配，这些数据可能因国家而异。为了评估所提出的标签技术的功效，我们构建了一个包含9,752项研究的CXR数据集，并使用该数据集的子集评估了我们的管道。以F1得分为至少0.9923，评估表明，我们的标签工具在所有类别中都精确而始终如一。构建数据集后，我们训练深度学习模型，以利用从大型公共CXR数据集传输的知识。我们采用各种损失功能来克服不平衡的多标签数据集的诅咒，并使用各种模型体系结构进行实验，以选择提供最佳性能的诅咒。我们的最佳模型（CHEXPERT-FRECTER EDIDENENET-B2）的F1得分为0.6989（95％CI 0.6740，0.7240），AUC为0.7912，敏感性为0.7064，特异性为0.8760，普遍诊断为0.8760。最后，我们证明了我们的粗分类（基于五个特定的异常位置）在基准CHEXPERT数据集上获得了可比的结果（十二个病理），以进行一般异常检测，同时在所有类别的平均表现方面提供更好的性能。

translated by 谷歌翻译

CathSim: An Open-source Simulator for Autonomous Cannulation

Tudor Jianu , Baoru Huang , Mohamed E. M. K. Abdelaziz , Minh Nhat Vu , Sebastiano Fichera , Chun-Yi Lee , Pierre Berthet-Rayne , Ferdinando Rodriguez y Baena , Anh Nguyen

分类：机器人

2022-08-02

血管内操作中的自主机器人有可能安全可靠地浏览循环系统，同时降低对人体错误的敏感性。但是，训练机器人的过程涉及许多挑战，例如由于机器学习算法的效率低下而导致的长期培训持续时间以及导管与血管内幻影之间的相互作用引起的安全问题。物理模拟器已在血管内手术的背景下使用，但通常用于员工培训，通常不符合自主插管目标。此外，大多数当前的模拟器都是封闭消息，它阻碍了安全可靠的自主系统的协作开发。在这项工作中，我们介绍了Cathsim，Cathsim是一种开源模拟环境，可加快用于自主内血管内导航的机器学习算法的开发。我们首先使用最先进的血管内机器人模拟高保真导管和主动脉。然后，我们在模拟环境中提供了导管和主动脉之间实时力传感的能力。我们通过使用两种流行的强化学习算法，近端策略优化（PPO）和软参与者（SAC）在两个主要动脉内执行两个不同的导管插入任务来验证我们的模拟器。实验结果表明，使用我们的开源模拟器，我们可以成功训练增强型学习剂以执行不同的自主插管任务。

translated by 谷歌翻译

Robust Rayleigh Regression Method for SAR Image Processing in Presence of Outliers

B. G. Palm , F. M. Bayer , R. Machado , M. I. Pettersson , V. T. Vu , R. J. Cintra

分类：机器学习

2022-07-29

合成孔径雷达（SAR）数据中的异常值（异常值）的存在以及统计图像模型中的错误指定可能导致推断不准确。为了避免此类问题，提出了基于强大的估计过程的瑞利回归模型，作为模拟此类数据的更现实的方法。本文旨在获得瑞利回归模型参数估计量与异常值的存在。提出的方法考虑了加权最大似然法，并使用模拟和测量的SAR图像提交了数值实验。使用蒙特卡洛模拟来评估有限信号长度中提出的可靠估计器性能，对离群值的敏感性和分解点。例如，非稳定估计器显示相对偏置值$ 65 $ - 折叠比损坏信号中强大方法提供的结果大。在灵敏度分析和分解点方面，强大的方案在两种措施的平均绝对值中分别降低了约96美元\％$和$ 10 \％$，以同情非稳定估计器。此外，使用两个SAR数据集比较了所提出的强稳定方案的地面类型和异常检测结果与文献中的竞争方法。

translated by 谷歌翻译

Developing an AI-enabled IIoT platform -- Lessons learned from early use case validation

Holger Eichelberger , Gregory Palmer , Svenja Reimer , Tat Trong Vu , Hieu Do , Sofiane Laridi , Alexander Weber , Claudia Niederée , Thomas Hildebrandt

分类：人工智能

2022-07-10

为了在工业生产中更广泛地采用AI，足够的基础设施能力至关重要。这包括简化AI与工业设备的集成，对分布式部署，监视和一致的系统配置的支持。现有的IIOT平台仍然缺乏以开放的，基于生态系统的方式灵活整合可重复使用的AI服务和相关标准（例如资产管理壳或OPC UA）的功能。这正是我们采用高度可配置的基于低代码的方法来解决我们下一个级别的智能工业生产生产生产Ecosphere（IIP-Ecosphere）平台所解决的问题。在本文中，我们介绍了该平台的设计，并根据启用AI支持的视觉质量检查的演示者讨论了早期评估。在这项早期评估活动中，学到的见解和教训补充了这一点。

translated by 谷歌翻译

QU-BraTS: MICCAI BraTS 2020 Challenge on Quantifying Uncertainty in Brain Tumor Segmentation -- Analysis of Ranking Metrics and Benchmarking Results

Raghav Mehta , Angelos Filos , Ujjwal Baid , Chiharu Sako , Richard McKinley , Michael Rebsamen , Katrin Dätwyler , Raphael Meier , Piotr Radojewski , Gowtham Krishnan Murugesan

分类：计算机视觉 | 机器学习

2021-12-19

深度学习（DL）模型为各种医学成像基准挑战提供了最先进的性能，包括脑肿瘤细分（BRATS）挑战。然而，局灶性病理多隔室分割（例如，肿瘤和病变子区）的任务特别具有挑战性，并且潜在的错误阻碍DL模型转化为临床工作流程。量化不确定形式的DL模型预测的可靠性，可以实现最不确定的地区的临床审查，从而建立信任并铺平临床翻译。最近，已经引入了许多不确定性估计方法，用于DL医学图像分割任务。开发指标评估和比较不确定性措施的表现将有助于最终用户制定更明智的决策。在本研究中，我们探索并评估在Brats 2019-2020任务期间开发的公制，以对不确定量化量化（Qu-Brats），并旨在评估和排列脑肿瘤多隔室分割的不确定性估计。该公制（1）奖励不确定性估计，对正确断言产生高置信度，以及在不正确的断言处分配低置信水平的估计数，（2）惩罚导致更高百分比的无关正确断言百分比的不确定性措施。我们进一步基准测试由14个独立参与的Qu-Brats 2020的分割不确定性，所有这些都参与了主要的Brats细分任务。总体而言，我们的研究结果证实了不确定性估计提供了分割算法的重要性和互补价值，因此突出了医学图像分析中不确定性量化的需求。我们的评估代码在HTTPS://github.com/ragmeh11/qu-brats公开提供。

translated by 谷歌翻译

Conservation Tools: The Next Generation of Engineering--Biology Collaborations

Andrew Schulz , Cassie Shriver , Suzanne Stathatos , Benjamin Seleb , Emily Weigel , Young-Hui Chang , M. Saad Bhamla , David Hu , Joseph R. Mendelson III , .

分类：机器学习

2023-01-03

The recent increase in public and academic interest in preserving biodiversity has led to the growth of the field of conservation technology. This field involves designing and constructing tools that utilize technology to aid in the conservation of wildlife. In this article, we will use case studies to demonstrate the importance of designing conservation tools with human-wildlife interaction in mind and provide a framework for creating successful tools. These case studies include a range of complexities, from simple cat collars to machine learning and game theory methodologies. Our goal is to introduce and inform current and future researchers in the field of conservation technology and provide references for educating the next generation of conservation technologists. Conservation technology not only has the potential to benefit biodiversity but also has broader impacts on fields such as sustainability and environmental protection. By using innovative technologies to address conservation challenges, we can find more effective and efficient solutions to protect and preserve our planet's resources.

translated by 谷歌翻译

Meta-learning generalizable dynamics from trajectories

Qiaofeng Li , Tianyi Wang , Vwani Roychowdhury , M. Khalid Jawed

分类：机器学习

2023-01-03

We present the interpretable meta neural ordinary differential equation (iMODE) method to rapidly learn generalizable (i.e., not parameter-specific) dynamics from trajectories of multiple dynamical systems that vary in their physical parameters. The iMODE method learns meta-knowledge, the functional variations of the force field of dynamical system instances without knowing the physical parameters, by adopting a bi-level optimization framework: an outer level capturing the common force field form among studied dynamical system instances and an inner level adapting to individual system instances. A priori physical knowledge can be conveniently embedded in the neural network architecture as inductive bias, such as conservative force field and Euclidean symmetry. With the learned meta-knowledge, iMODE can model an unseen system within seconds, and inversely reveal knowledge on the physical parameters of a system, or as a Neural Gauge to "measure" the physical parameters of an unseen system with observed trajectories. We test the validity of the iMODE method on bistable, double pendulum, Van der Pol, Slinky, and reaction-diffusion systems.

translated by 谷歌翻译